Approximation Algorithms for Multiple Sequence Alignment Under a Fixed Evolutionary Tree
نویسندگان
چکیده
We consider the problem of multiple sequence alignment under a xed evolu tionary tree given a tree whose leaves are labeled by sequences nd ancestral sequences to label its internal nodes so as to minimize the total length of the tree where the length of an edge is the edit distance between the sequences labeling its endpoints We present a new polynomial time approximation algorithm for this problem and analyze its performance on regular d ary trees with d a constant On such a tree the algorithm nds a solution within a factor d d of the minimum in O kT d n k n time where k is the number of leaves in the tree n is the length of the longest sequence labeling a leaf and T d n is the time to compute a Steiner point for d sequences of length at most n A Steiner point for a set S of sequences is a sequence P that minimizes the sum of the edit distances from P to each sequence in S The time T d n is O d n given O ds time preprocessing for an alphabet of size s The approximation algorithm is conceptually simple and easy to implement and actually applies to any metric space in which a Steiner point for any xed sized set can be computed in polynomial time We also introduce a new problem bottleneck tree alignment in which the objective is to label the internal nodes of the tree so as to minimize the length of the longest edge We describe an exponential time exact algorithm for the case of unit cost edit operations and show there is a simple linear time approximation algorithm for the general case that nds a solution within a factor O log k of the minimum
منابع مشابه
Application of Evolutionary Algorithms for Multiple Sequence Alignment
Multiple Sequence Alignment is a crucial task in Bioinformatics. Most of the commonly used multiple alignment methods are based on a dynamic programming approach. This approach however requires time proportional to the product of the sequence lengths and also doesn’t provide an extensible platform for evaluating different objective functions. Tree-based algorithms, which combine results from pa...
متن کاملAn Application of the ABS LX Algorithm to Multiple Sequence Alignment
We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...
متن کاملImproved Approximation Algorithms for Tree Alignment
Multiple sequence alignment is a task at the heart of much of current computaw x tional biology 4 . Several different objective functions have been proposed to formalize the task of multiple sequence alignment, but efficient algorithms are lacking in each case. Thus multiple sequence alignment is one of the most critical, essentially unsolved problems in computational biology. In this paper we ...
متن کاملgpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences
Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...
متن کاملFixed Topology Alignment with Recombination
In this paper, we study a new version of multiple sequence alignment, xed topology alignment with recombination. We show that it can not be approximated within any constant ratio unless P = NP. For a more restricted version, we show that the problem is MAX-SNP-hard. This implies that there is no PTAS for this version unless P = NP. We also propose approximation algorithms for a special case, wh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Discrete Applied Mathematics
دوره 88 شماره
صفحات -
تاریخ انتشار 1995